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Chapter  1:  Introduction 


1.1  What  is  DAML,  OIL ,  the  Semantic  Web? 

DAML  is  the  United  States  Defense  Advanced  Research  Projects  Agency’s  (DARPA) 
Agent  Markup  Language.  It  is  an  effort  to  define  a  Web-based  machine-understandable 
knowledge  representation.  OIL,  the  Ontology  Inference  Layer,  is  a  European  sister  effort. 
DAML+OIL  merges  the  languages  into  a  single  proposed  standard. 

Both  these  efforts  are  part  of  an  attempt  to  evolve  the  current  World-Wide -Web  (W3) 
into  the  Semantic  Web  in  which  machines  can  read  content  without  “Web-scaping”  or 
“wrappering”  Web  site  content  by  hand. 


http://www.sciam.com/2001/0501issue/0501bernerslee.html  is  an  excellent  general- 
audience  introduction  to  the  concept  of  the  Semantic  Web. 


1.2  What  is  WebScripter? 


WebScripter  is  a  tool  that  allow  ordinary  users  to  make  reports  of  DAML  data  distributed 
over  the  Web.  You  can  add  columns  by  browsing  any  DAML  data  in  a  hierarchy  outline 
and  then  add  data  in  a  new  column  or  to  an  existing  column.  While  you  are  doing  that, 
WebScripter  records  which  ontological  classes  and  fields  were  used  together  (in  a  DAML 
format),  so  that  others  can  draw  from  that  when  making  their  reports  (“people  who  used 
ISLLastName  also  used  Oxford:Surname  and  Karlsruhe :Vorname”).  Thus,  WebScripter 
also  offers  no-cost  implicit  collaborative  ontology  translation. 


1.3  Why  would  I  want  to  use  WebScripter? 


Using  DAML  and  WebScripter,  you  can  publish  data  once  and  then  republish  it  many 
times.  In  a  large  organization,  you  can  allow  your  individual  units  to  publish  their  data  in 
a  format  that  makes  sense  to  them,  yet  integrate  it  into  seamless  reports  using 
WebScripter.  You  can  break  out  a  data  entry  task  both  “horizontally”  (different  units 


1 


supply  different  instances/rows),  or  “vertically”  (different  units  supply  different 
attributes/columns  for  the  same  instances),  or  any  combination  thereof. 


1.4  What  are  examples  of  WebScripter  use? 

For  example,  point  your  browser  to  http://www.isi.edu/divisions/div2 
and  click  on  People. 

What  you’re  seeing  is  the  output  of  a  WebScripter  report  (albeit  formatted  with  a  custom 
XSL  stylesheet).  For  example,  the  information  about  the  office  locations  and  phone 
numbers  is  maintained  by  the  ISI  business  office  while  the  matrix  of  who  works  on  which 
projects  is  maintained  by  our  Division,  and  the  individual  interests  are  maintained  by 
individuals.  WebScripter  fuses  all  this  information  together  into  a  coherent  report. 


Scroll  all  the  way  down  on  the  page  and  click  on  “report”  in  the  sentence  “This  page  is 
automatically  generated  and  maintained  based  on  a  Webscripter  report  that  fetches  and 
fuses...”  and  you  click  on  that.  What  you  are  seeing  is  the  standard  look  of  a  WebScripter 
report.  The  color  coding  indicates  the  source  of  the  DAML  data.  The  polished  People 
page  is  made  by  a  simple  XSL  transformation  of  the  XML  version  of  this  WebScripter 
report. 


1.5  How  can  I  obtain  WebScripter? 

Go  to  http://www.isi.edu/webscripter  and  click  on  Download.  This  gives  you  everything 
you  need  to  run  WebScripter  and  make  reports,  including  example  reports,  and  this 
manual. 

You  can  obtain  the  complete  source  codeof  WebScripter  by  signing  a  license  agreement 
with  us. 

You  can  also  use  the  WebScripter  servlet  service  at  to  run  WebScripter  reports  without 
installing  any  software.  Go  to  http://www.isi.edu/webscripter  and  click  on  “as  Web 
Service”. 
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Chapter  2:  Installation 


The  only  requirement  is  a  platfonn  that  supports  the  Sun  Microsystem  Java  Development 
Kit  (JDK)  1 .4.0  or  higher  (note  that  we  know  for  sure  that  JDK  1.3.1  will  not  work). 

Say  “jar  xvf  webscripterdistribution.jar”  if  you  have  the  JDK  bin  directory  in  your  path, 
or  something  like  “C:/jdk  1.4. 0/bin/jar  xvf  webscripterdistribution.jar”  otherwise.  If  you 
are  on  a  PC,  you  can  alternatively  use  WinZip  to  unpack  the  WebScripter  distribution. 

Either  way  you  will  end  up  with  a  directory  called  webscripter  on  your  hard  disk.  If  you 
are  on  a  PC  and  you  have  the  JDK  bin  directory  in  your  path  you  should  be  able  to 
simply  double-click  on  “runinteractivewindows.bat”  in  the  webscripter  directory  to  start 
WebScripter.  On  a  PC,  you  can  also  create  a  shortcut  to  that  file  and  place  the  shortcut  on 
your  desktop  as  a  convenience. 

If  you  are  running  on  a  Macintosh  or  Unix  variant,  copy  and  modify  the 
“runinteractivewindows.bat”  file  to  follow  the  platform’s  conventions  (for  example,  Unix 
uses  colons  to  separate  CLASSPATH  entries  rather  than  the  semicolons  used  by 
Windows). 
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Chapter  3:  Tutorial 


This  part  of  the  manual  describes  how  to  use  the  graphical  version  of  WebScripter.  (The 
Reference  chapter  explains  how  to  work  with  the  textual  XML  format  of  a  WebScripter 
report  definition,  which  is  more  cumbersome  but  also  more  powerful,  and  can  also  be 
used  to  rerun  reports  you  defined  interactively  from  the  command  line.) 


3.1  Generating  a  simple  WebScripter  report 

Start  the  graphical  version  of  WebScripter  as  described  in  Chapter  2.  You  should  see  a 
window  with  a  blank  spreadsheet. 

Load  the  URL  http://www.isi.edu/webscripter/div2people.qen.html  into  your  favorite 
Web  browser.  What  you  are  looking  at  is  the  HTML  output  of  a  previous  run  of  Web 
scripter.  However,  the  content  is  also  embedded  as  DAML  into  the  Web  page.  (Say 
“View  Source”  in  your  Web  browser  and  scroll  towards  the  end.) 

Thus,  you  can  use  the  embedded  DAML  in  that  page  for  further  processing.  Choose 
“Example  Report”  from  the  file  menu.  You  should  see  an  outline  of  a  simple  concept 
hierarchy  on  the  left,  and  the  content  of  a  report  in  the  upper  right.  Left  click  on 
Div2Role. 


Now,  let’s  be  more  selective  about  which  data  we  want  to  appear  in  our  report.  Choose 
“New  Report”  from  the  File  menu.  Go  to  any  field  in  the  second  or  subsequent  row  of 
column  B  and  type  “Martin”.  Highlight  the  field  by  first  clicking  on  another  cell  and  then 
clicking  on  the  “Martin”  cell  (you  should  see  a  border  around  it),  then  click  on  the 
Classify  button.  Following  are  some  example  screen  shots: 
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A  popup  window  will  appear  which  most  likely  will  give  you  only  a  single  choice 
which  is  “firstname”.  Choose  it  and  click  on  “Set”.  You  now  have  a  report  which  lists 
our  first  names  and  nothing  else.  WebScripter  has  automatically  classified  the  data  you 
typed  to  the  DAML  attribute  URI  “http://www.isi.edu/webscripter/div2- 
org .  o .  daml#firstname”. 


Classifying  is  just  one  way  of  adding  columns,  for  one  of  a  single  ontological  type  it  is 
actually  easier  to  do  this:  click  on  Div2Member,  highlight  a  field  in  the  “firstname” 
column,  and  say  choose  “new  report  column”  from  the  right  mouse  button  menu  for  the 
same  result. 


Scroll  down  and  type  “Frank”  in  column  C  in  a  blank  row,  select  the  cell,  and  click  on 
Classify  again.  You  will  probably  have  a  whole  bunch  of  matches  in  the  popup  window 
now  because  “frank”  is  my  ISI  email  prefix  and  used  as  a  key  for  many  attributes. 
Choose  “lastname”  from  the  choices  given.  Our  last  names  now  appear  as  well  in 
column  C. 


So  far,  we  have  only  drawn  data  from  one  data  source,  the  Information  Sciences 
Institute’s.  What  makes  WebScripter  and  DAML  in  general  exciting  is  that  you  can 
draw  in  and  mix  data  from  many  different  sources. 


Load  http://www.aifb.unikarlsruhe.de/WBS/sha/index.html  as  a  DAML  file.  Load 
http://wwwdb.stanford.edu/~yhjin/  as  a  DAML  file.  (You  can  find  both  of  these  people 
and  more  listed  under  http://wwwdb.stanford.edU/Ontoagents/#participants.) 


Now  type  “Siegfried”  in  an  empty  cell  of  column  B,  select  it,  and  Classify  it;  select 
“firstName”  from  the  popup  window  and  click  “Set”.  Type  “Jin”  in  an  empty  cell  of 
column  C,  select  it,  and  Classify  it;  select  “lastName”. 

The  below  instructions  rely  on  external  data  not  maintained  by  us;  so  we  can’t  guarantee 
that  the  pages  won’t  have  disappeared  etc.  by  the  time  you  read  this. 
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Both  Siegfried  and  Yuhui  now  appeared  in  the  list  of  people’s  names. 

Finally,  click  in  the  yellow  header  column,  and  you  will  see  a  popup  window  that  lists  the 
URIs  of  DAML  properties  that  go  into  this  column.  You  have  also  implicitly  declared 
that  you  regard  those  two  properties  as  equivalent  at  least  for  the  purpose  of  this  report; 
WebScripter  will  write  out  that  fact  as  DAML  “samePropertyAs”  statements  which  can 
later  benefit  yourself  and  others  by  WebScripter  offering  to  incorporate  all  of  those 
whenever  any  of  them  are  used.  (More  about  that  in  a  later  chapter.) 

Choose  “Mapping  Info”  from  the  Output  menu  to  view  this  equivalency  information  in 
DAML. 

Choose  “As  HTML”  from  the  Output  menu  to  see  the  standard  HTML  format  of  the 
report  you  just  created. 

Choose  “Report  Definition”  from  the  Output  menu  to  obtain  the  WebScripter  report 
definition  “wsrd”  file;  it  is  in  XML  format,  and  you  can  refer  to  Chapter  4  to  edit  the 
XML  by  hand  (for  example,  this  lets  you  sort  rows,  restrict  rows  by  content,  and  perform 
other  operations  that  can  currently  not  yet  be  performed  interactively). 

Finally,  WebScripter’s  “Easy  Publish”  feature  in  the  “Windows”  menu  will  let  you  save 
your  WebScripter  report  definition  on  an  ISI  server,  and  will  return  a  URL  which  will 
dynamically  recomputes  the  report  content  whenever  anyone  accesses  it.  If  either  you 
have  no  Internet  connection  or  our  servlet  service  is  down  WebScripter  will  go  blank  for 
20  seconds  or  so,  then  return  to  life  with  a  Java  exception  printed  to  standard  error  (which 
you  can  safely  ignore). 

Easy  Publish  gives  you  a  quick  way  to  play  with  WebScripter  and  share  the  results  with 
others.  You  should  obviously  (a)  not  use  the  Easy  Publish  feature  for  production  level 
content  that  is  mission  critical  to  your  organization  [we  make  no  guarantees  that  the 
servlet  will  remain  available  forever],  and  (b)  not  use  it  to  publish  reports  of  content  you 
want  to  keep  secret  [because  anyone  can  see  any  Easy  Publish  report]. 
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Chapter  4:  Reference 


The  interactive  version  of  WebScripter  is  now  a  solid  way  of  constructing  and 
maintaining  simple  reports,  but  you  can  also  edit  report  definitions  manually,  and  then 
load  them  back  into  interactive  WebScripter,  or  you  can  run  a  WebScripter  report  via  the 
following  command  line  if  you  are  in  the  WebScripter  distribution  directory  that  contains 
all  of  the  Jar  files.  (On  a  PC,  on  other  platforms,  you  may  have  to  replace  the  semicolons 
with  colons  and  so  forth.) 

Java  -classpath 

’  j  ena.j  ar;wsall.j  ar  ;rdf-api-200 1  -0 1  - 1 9 .j  ar;xerces  .j  ar;xalan.j  ar  ;gnu-regexp- 1 .0 . 8  .j  ar  ’ 
Dorg.xml.sax.parser=org.apache.xerces.parsers.SAXParser 
edu.isi.dce.webscripter.report.main.WsMnMain 
-parser  jena 

-definition  http://www.isi.edu/webscripter/div2people.wsrd 
-printAllTriples  false 
-htmlOutput  example.gen.html 
-xmlOutput  example. gen.xml 

About  the  command  line  arguments:  the  parser  argument  is  either  jena  or  rdfapi,  latter  is 
only  recommended  for  backward  compatibility;  -printAllTriples  will  do  just  that  after 
loading  all  your  DAML  input  examining  that  is  the  only  sensible  way  to  make 
WebScripter  reports  of  pure  RDF  data  with  neither  a  DAML  nor  an  RDFS  schema  behind 
it  because  it  will  not  show  you  a  class  hierarchy  on  the  left  in  that  case. 

WebScripter  reports  explicitly  specify  the  DAML  data  source  pages  that  will  be  loaded  to 
produce  the  report.  (Additional  pages  can  also  be  loaded  dynamically;  more  about  that 
later.)  In  the  future,  explicitly  specifying  source  pages  may  not  always  be  necessary  by 
using  a  web-crawler  that  knows  where  to  find  DAML  content. 

The  data  that  results  from  instantiating  a  WebScripter  report  definition  is  always  in 
tabular  fonn.  (Note  that  it  is  not  in  ’’relational”  form  in  the  database  sense  because  it  is 
not  in  First  Normal  Fonn  -  a  single  cell  in  the  table  can  have  multiple  values.) 

The  first  column  of  a  table  is  special  because  it  defines  which  rows  will  appear  in  the 
report;  we  call  it  the  Generator  Column.  It  is  the  seed  of  all  other  content  in  the  table. 

The  second  and  all  subsequent  columns  are  Extractor  Columns.  They  are  computed 
from  the  cell  content  of  a  specified  earlier  column  and  a  transformation.  RDF  triples 
have  a  subject  predicate  object  structure;  the  cell  content  of  an  extractor  column  is 
computed  by  collecting  all  objects  that  match  the  column’s  predicate  for  the  subjects 
in  the  source  cell;  this  can  also  be  reversed  by  setting  Islnverse  to  true. 

Sometimes  you  want  to  have  certain  columns  in  your  report  even  though  you  don’t 
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want  to  show  then  to  the  end  users.  For  example,  you  can  use  a  column  for  sorting  or 
filtering  the  rows  of  the  report  but  hide  the  column  itself.  You  can  do  that  for  any 
column  by  setting  the  Visible  tag  to  false. 

You  can  sort  by  one  or  more  columns,  ascending  or  descending. 


You  can  also  filter  out  rows.  Every  cell  in  a  WebScripter  report  can  in  principle  have 
multiple  values.  Thus,  filter  expressions  operate  on  list  valued  cells,  even  if  you 
believe  that  there  should  always  be  at  most  one  value  for  a  cell.  (But  you  should  plan 
for  the  case  that  there  are  multiple  values;  for  example,  we  initially  expected  to-do 
items  to  relate  to  exactly  one  project,  but  it  later  turned  out  that  it  made  perfect  sense 
that  some  to-do  items  such  as  Meta-Ja  improvements  apply  to  multiple  projects.)  Here 
are  some  simple  examples. 

<KeepRowIf>contains(.*2002.*)</KeepRowIf> 

<KeepRowIf>contains(http://www.isi.edu/~frank/metaja)</KeepRowIf> 

<KeepRowIf>doesNotContain(\#ClosureReport)</KeepRowIf> 

<KeepRowIf>doesNotContain(szekely|rneches)</KeepRowIf> 

The  expression  within  the  brackets  is  a  GNU  regular  expression,  which  means  you 
can  use  sophisticated  wild  cards.  A  good  overview  of  GNU  regular  expressions  is  at 
http://www.emerson.emory.edu/services/editors/ne/Regular  Expressions.html  (this  is  an 
external  site  which  we  have  no  control  over  try  “GNU  regular  expression”  in  a  search 
engine  it  does  not  work). 


Finally,  note  that  you  can  load  from  URLs  which  do  not  start  with  “http:”  In  particular, 
you  can  use  “file:x.daml”  to  load  local  file  x.daml,  presuming  such  a  file  exists  in  your 
WebScripter  directory  (the  one  that  contains  wsall.jar  ).  You  can  probably  also  load  from 
subfolders  using  “file:subfolder/x.daml”  syntax  and  from  superfolders  using 
“file:../../whatever/x.daml”  syntax,  but  that  is  specific  to  your  operating  system  and  Java 
runtime  implementation. 


4.1  Single-ontology  reports 

Here  is  how  I  keep  to-do  item  lists  on  the  Meta-Ja  project 
(http : // www .  isi .  edu/  frank/ metaj  a/) . 

First,  here  is  the  list  of  open  items.  In  a  nutshell,  the  report  lists  all  to-do  items 
that  (a)  refer  to  the  Meta-Ja  project,  and  that  (b)  are  not  referred  to  by  any 
closure  report. 

http://www.isi.edu/webscripter/metajatodo.gen.html 

<Definition> 

<ReportName>Meta-Ja  Desiderata</ReportName>  <Description> 
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These  items  describe  potential  enhancements  to  Meta-Ja  that  I  believe  are  worthwhile  in  principle.  The  existence  of 
items  on  this  list  does  not  imply  that  I  promise  to  ever  implement  any  of  them  not  even  if  you  have  paid  a  commercial 
Meta-Ja  licensee  fee 

(because  maintaining  MetaJa  is  not  part  of  my  regular  job  the  licensing  fees  are  not  nearly  enough  to  cover  my  time). 
</Description> 

<GeneratorColumn> 

<ld>ld</ld> 

<Visible>true</Visible> 

<GeneratorSources> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.  isi.edu/webscripter/todo.  gen.  d.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.isi.edu/webscripter/mrftodoaddenda.d.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

</GeneratorSources> 

<Classldentifiers><String>http://www.isi.edu/webscripter/todo.o.daml#ToDoltem</String></Classldentifiers> 

</GeneratorColumn> 

<ExtractorColumns> 

<ExtractorColumn> 

<ld>ToDo  ltem</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#label</Uri></Predicate></Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Project</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#project</Uri></Predicate></Predicates> 

<KeepRowlf>contains(http://www.isi.edu/~frank/metaja)</KeepRowlf> 

<Visible>false<A/isible> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Priority</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#priority</Uri></Predicate></Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Description</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#description</Uri></Predicate></Predicate 

s> 

</ExtractorColumn> 

<ExtractorColumn>  <ld>Entered</ld>  <AnchorColumnld>ld</AnchorColumnld> 
<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#entered</Uri></Predicate></Predicates> 
</ExtractorColumn> 

<ExtractorColumn> 

<ld>Annotations</ld> 

<Visible>false<A/isible> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates><Predicate> 

<Uri>http://www.isi.edu/webscripter/todo.o.daml#todoitemid</Uri> 

<lnverse>true</lnverse> 

</Predicate></Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Annotation  Types</ld> 

<Visible>false<A/isible> 

<AnchorColumnld>Annotations</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.w3.org/1999/02/22rdfsyntaxns#type</Uri></Predicate></Predicates> 

<KeepRowlf> 

doesNotContain(http://www.isi.edu/webscripter/todo.o.daml#ClosureReport) 

</KeepRowlf> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Comment  Dates</ld> 

<Visible>false<A/isible> 

<AnchorColumnld>Annotations</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#entered</Uri></Predicate></Predicates> 

</ExtractorColumn> 

<ExtractorColumn>  <ld>Comments</ld>  <Visible>false</Visible>  <AnchorColumnld>Annotations</AnchorColumnld> 
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<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#content</Uri></Predicate></Predicates> 

</ExtractorColumn> 

</ExtractorColumns> 

<Sortinglnstructions> 

<Sortinglnstruction> 

<Columnld>Entered</Columnld> 

<Ascending>false</Ascending> 

</Sortinglnstruction>  </Sortinglnstructions> 

</Definition> 


Second,  here  is  the  WebScripter  report  definition  of  closed  items.  It  lists  to  do  items  that 
refer  to  the  MetaJa  project  and  that  do  have  a  closure  report  associated  with  them. 
http://www.isi.edu/webscripter/metajadone.gen.html 

<Definition> 

<ReportName>MetaJa  Enhancements</ReportName>  <Description> 

Here  is  a  list  of  what  I  recently  did  on  MetaJa  (listed  by  closure  date,  most  recent  first). 

</Description> 

<GeneratorColumn> 

<ld>ToDo  ld</ld> 

<Visible>false<A/isible> 

<GeneratorSources> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www. isi.edu/webscripter/todo. gen. d.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.isi.edu/webscripter/mrftodoaddenda.d.daml</UrlOfAnRdfFile> 

</GeneratorSource>  </GeneratorSources> 

<Classldentifiers><String>http://www.isi.edu/webscripter/todo.o.daml#ToDoltem</String></Classldentifiers> 

</GeneratorColumn> 

<ExtractorColumns> 

<ExtractorColumn> 

<ld>ToDo  ltem</ld> 

<AnchorColumnld>ToDo  ld</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#label</Uri></Predicate></Predicates> 

</ExtractorColumn> 

<ExtractorColumn>  <ld>Project</ld>  <AnchorColumnld>ToDo  ld</AnchorColumnld>  <Visible>false</Visible> 
<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#project</Uri></Predicate></Predicates> 
<KeepRowlf>contains(http://www.isi.edu/~frank/metaja)</KeepRowlf> 

</ExtractorColumn> 

<ExtractorColumn>  <ld>Privacy  Setting</ld>  <AnchorColumnld>ToDo  ld</AnchorColumnld> 
<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#privacysetting</Uri></Predicate></Predic 
ates>  <KeepRowlf>doesNotContain(. ‘private. *)</KeepRowlf>  <Visible>false</Visible> 

</ExtractorColumn> 

<ExtractorColumn>  <ld>Description</ld>  <AnchorColumnld>ToDo  ld</AnchorColumnld> 
<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#description</Uri></Predicate></Predicate 
s> 

</ExtractorColumn> 

<ExtractorColumn>  <ld>Entered</ld>  <AnchorColumnld>ToDo  ld</AnchorColumnld> 
<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#entered</Uri></Predicate></Predicates> 
</ExtractorColumn> 

<ExtractorColumn> 

<ld>Annotations</ld> 

<Visible>false</Visible> 

<AnchorColumnld>ToDo  ld</AnchorColumnld> 

<Predicates><Predicate> 

<Uri>http://www.isi.edu/webscripter/todo.o.daml#todoitemid</Uri> 

<  I  n  verse>true</l  n  verse> 

</Predicate></Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Annotation  Types</ld> 

<Visible>false</Visible> 

<AnchorColumnld>Annotations</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.w3.org/1999/02/22rdfsyntaxns#type</Uri></Predicate></Predicates> 

<KeepRowlf> 

contains(http://www.isi.edu/webscripter/todo.o.daml#ClosureReport) 

</KeepRowlf> 
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</ExtractorColumn> 

<ExtractorColumn> 

<ld>Closed</ld> 

<AnchorColumnld>Annotations</AnchorColumnld> 

<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#entered</Uri></Predicate></Predicates> 

</ExtractorColumn> 

<ExtractorColumn>  <ld>Closure  Remark </ld>  <AnchorColumnld>Annotations</AnchorColumnld> 
<Predicates><Predicate><Uri>http://www.isi.edu/webscripter/todo.o.daml#remark</Uri></Predicate></Predicates> 
</ExtractorColumn> 

</ExtractorColumns> 

<Sortinglnstructions> 

<Sortinglnstruction> 

<Columnld>Entered</Columnld> 

<Ascending>false</Ascending> 

</Sortinglnstruction>  </Sortinglnstructions> 

</Definition> 
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4.2  A  multiontology  report 


The  report  below  is  interesting  because  it  draws  data  from  multiple  DAMLized  Web 
pages  that  use  multiple  ontologies  and  whose  authors  did  not  consult  each  other  when 
marking  up  their  pages. 

http://www.isi.edu/webscripter/damlpersonnel.gen.html 

<Definition> 

<ReportName>DAML  Personnel</ReportName>  <Description> 

An  attempt  to  pull  together  as  much  information  about  DAML  personnel  as  I 
(Martin  Frank)  can  find  data  for,  using  WebScripter. 

</Description> 

<GeneratorColumn> 

<ld>ld</ld> 

<Visible>true<A/isible> 

<GeneratorSources> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://wwwdb. Stanford. edu/Ontoagents/#participants</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://wwwdb. Stanford. edu/~stefan/#person_stefan_decker</UrlOfAnRdfFile> 
</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.aifb.unikarlsruhe.de/WBS/sha/index.html#person:sha</UrlOfAnRdfFile> 
</GeneratorSource>  <GeneratorSource> 

<UrlOfAnRdfFile>http://wwwdb. Stanford. edu/~melnik/index.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://wwwdb. Stanford. edu/~yhjin</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.aifb.unikarlsruhe.de/~sst</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.ksl. Stanford. edu/people/dlm/</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://kslweb. Stanford. edu/people/sam/</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.cs.umd.edu/~hendler/jhendler.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.cs.cmu.edu/~softagents/atlas/daml_pages/payne_terence.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource>  <UrlOfAnRdfFile>http://www. semanticweb.org/ontologies/swrc- 
onto-2000-09-1 0.daml</UrlOfAnRdfFile>  </GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.isi.edu/webscripter/div2org.o.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.isi.edu/webscripter/div2roster.d.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFiie>http://www.isi.edu/webscripter/div2projects.gen.d.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.isi.edu/webscripter/div2roles.d.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<CrawlDepth>1</CrawlDepth> 

<UrlOfAnRdfFile>http://www.isi.edu/webscripter/div2registry.d.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 
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<UrlOfAnRdfFile>http://www.isi.edu/webscripter/div2display.d.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

<GeneratorSource> 

<UrlOfAnRdfFile>http://www.isi.edu/webscripter/damlpersonneladdenda.daml</UrlOfAnRdfFile> 

</GeneratorSource> 

</GeneratorSources> 

<Classldentifiers> 

<String>http://www.isi.edu/webscripter/div2org.o.daml#Div2Member</String> 

<String>http://www.semanticweb.org/ontologies/swrconto20000910.daml#Person</String> 

<String>http://www.semanticweb.org/ontologies/swrconto20000910.daml#FullProfessor</String> 

<String>http://www.w3.org/1999/02/22rdfsyntaxns#FacultyMember</String> 

<String>http://www.semanticweb.org/ontologies/swrconto20000910.daml#PhDStudent</String> 

<!prevent  duplicate  entry  for  Stefan  http://www.semanticweb.Org/ontologies/swrconto20000910.daml#AcademicStaff 
> 

<String>http://www.semanticweb.org/ontologies/swrconto20000910.daml#Lecturer</String> 

<String>http://www.ksl. Stanford. edu/projects/DAML/ksldamldesc.daml#Person</String> 

<String>http://ksl. Stanford. edu/Projects/DAML/ksldamldesc.daml#PERSON</String> 

<String>http://ksl. stanford.edu/Projects/DAML/ksldamldes  c.daml#PERSON</String> 

</Classldentifiers> 

</GeneratorColumn> 

<ExtractorColumns> 

<ExtractorColumn> 

<ld>Name</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#name</Uri></Predicate> 
<Predicate><Uri>http://www.ksl. Stanford. edu/projects/DAML/ksldamldesc.daml#hasFullName</Uri></Predicate> 
<Predicate><Uri>HasFullName</Uri></Predicate> 

</Predicates> 

</ExtractorColumn> 

<ExtractorCoiumn> 

<ld>First</ld> 

<AnchorColumnld>ld</AnchorCoiumnld> 

<Predicates> 

<Predicate><Uri>http://www.isi.edu/webscripter/div2org.o.daml#firstname</Uri></Predicate> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#firstName</Uri></Predicate> 

</Predicates> 

<KeepRowlf>doesNotContain(Jeannine)</KeepRowlf> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Last</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.isi.edu/webscripter/div2org.o.daml#lastname</Uri></Predicate> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#lastName</Uri></Predicate> 

</Predicates> 

<KeepRowlf>doesNotContain(Park|Balasundaram|Ko|Bugacov|Wang|Salemi|Benjamin|Will|Yao|Seshadri|Kim|Chen| 
Medvidovic|MacGregor|Eisenstein|Eleish)</KeepRowlf>  </ExtractorColumn>  <ExtractorColumn> 

<ld>Affiliation</ld> 

<AnchorColumnld>ld</AnchorCoiumnld> 

<Predicates> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#affiliation</Uri></Predicate> 

</Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Role</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.isi.edu/webscripter/div2org.o.daml#role</Uri></Predicate> 

<Predicate><Uri>HasResearchPosition</Uri></Predicate> 

</Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Email</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.isi.edu/webscripter/div2org.o.daml#preferredEmailAddress</Uri></Predicate> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#email</Uri></Predicate> 
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<Predicate><Uri>http://www.cs.umd.edu/projects/plus/DAML/onts/personal1.0.daml#emailAddress</Uri></Predic 

ate> 

<Predicate><Uri>HasEmailAddress</Uri></Predicate> 

</Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Phone</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.isi.edu/webscripter/div2org.o.daml#phonenumber</Uri></Predicate> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#phone</Uri></Predicate> 

<Predicate><Uri>HasPhoneNumber</Uri></Predicate> 

</Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>lnterests</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.isi.edu/webscripter/div2org.o.daml#interests20words</Uri></Predicate> 

</Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Home  Page</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.isi.edu/webscripter/div2org.o.daml#homepage</Uri></Predicate> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#homepage</Uri></Predicate> 

<Predicate><Uri>HasHomePage</Uri></Predicate> 

</Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Project  IDs</ld> 

<AnchorColumnld>ld</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.isi.edu/webscripter/div2org.o.daml#workson</Uri></Predicate> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto200009- 

10.daml#worksAtProject</Uri></Predicate> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#headOf</Uri></Predicate> 
<Predicate><Uri>http://www.cs.  umd.edu/projects/plus/DAML/onts/generall  .O.daml#head</Uri></Predicate> 
</Predicates> 

</ExtractorColumn> 

<ExtractorColumn> 

<ld>Project  Nicknames</ld> 

<AnchorColumnld>Project  IDs</AnchorColumnld> 

<Predicates> 

<Predicate><Uri>http://www.w3.org/2000/01/rdfschema#label</Uri></Predicate> 

<Predicate><Uri>http://www.semanticweb.org/ontologies/swrconto20000910.daml#title</Uri></Predicate> 

</Predicates> 

</ExtractorColumn> 

</ExtractorColumns> 

<Sortinglnstructions> 

<Sortinglnstruction> 

<Columnld>Last</Columnld> 

<Ascending>true</Ascending> 

</Sortinglnstruction>  </Sortinglnstructions> 

</Definition> 
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Chapter  5:  Background  and  Pointers 


Here  are  the  normative  references  to  the  languages  that  WebScripter  uses. 


5.1  XML 

http://www.w3  .org/XML/ 


5.2  RDF 

http://www.w3.org/RDF/ 


5.3  RDF  Schema 

http://www.w3.org/TR/2000/CRrdfschema2000Q327/ 


5.4  DAML 

http://www.daml.org/2001/03/daml+oilindex.html 
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Appendix  A 


Revision  History 

From  1.1  July  10th  to  1.2  July  25th,  2001:  added  installation  instructions  for  binary 
distribution 

•  1.3  August  16,  2001:  added  more  examples  and  generally  cleaned  up  in 
preparation  for  first  publicly  downloadable  release 

•  2.0  February  9th,  2002:  complete  rewrite  of  internals,  based  on  Jena  DAML 
parser  by  default  now,  12  orders  of  magnitude  faster  report  processing,  can  handle 
10+  MB  of  DAML  input  files,  you  no  longer  have  to  list  subclasses  explicitly  to 
pull  them  in,  GUI  enhancements:  sorting  by  multiple  columns,  moving  columns, 
adding/deleting/modifying  column  definitions  and  DAML  input  file  list,  loading 
with  crawl  depth  of  1  from  the  GUI 

•  2.2  May  31st,  2002:  large  number  of  improvements,  including  guessing  ok 
linkages  between  classes  and  offering  a  menu  of  that  to  the  user;  also  sorting  and 
ordering  of  the  class  hierarchy  by  various  criteria;  plain  HTML  pages  can  now  be 
used  as  a  jumpoff  point  for  loading  RDF  pages  it  points  to  (ISWC’2002  support); 
can  now  parse  embedded  RDF  out  of  HTML  pages  with  Jena 
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